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T cells specific for kidney carcinoma 

Description 

The present invention concerns new nucleic acid and 
amino acid sequences of the human T cell receptor and 
their use for the diagnosis and therapy of carcinomas in 
particular of kidney cell carcinomas. 

The T lymphocytes of the immune system are responsible 
for the cellular immune response. They are able to 
recognize and eliminate diseased body cells, e.g. cells 
which contain foreign proteins, or tumour cells. 
Diseased body cells are recognized by the so-called T 
cell receptor (TCR) which binds an antigen in the form 
of short peptide fragments which is specific for the 
diseased cell. These peptide fragments are presented by 
MHC molecules on the cell surface. 

T cell receptors are composed of two different 
polypeptide subunits, usually the so-called T cell 
receptor a or p chains which are linked together by a 
disulfide bridge. The a and p chains are in turn 
composed of variable and constant regions. The variable 
regions of the a chain comprise V and J gene segments 
and the variable regions of the (3 chain comprise V, D 
and J gene segments. 

The TCR a chain gene is composed of over 100 variable 
segments each of which contains an exon for a V region 
in front of which there is another exon which codes for 
a leader sequence which enables transport of the protein 
to the cell surface. A group of 61 J segments lies at a 
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considerable distance from the V segments. The J 
segments are followed by a single C segment for the 
constant region which in turn contains separate exons 
for the constant region and the hinge region as well as 
an exon for the transmembrane and cytoplasm regions. 

The TCR p chain gene contains a group of approximately 
30 V gene segments which are at some distance from 2 
separate clusters which each contain a single D segment 
and 6 or 7 J segments as well as a single C segment. 
Each constant segment of the P chain has separate exons 
for the constant, the hinge, the transmembrane and the 
cytoplasm region. 

During the development of the T cell the separate 
segments are linked by somatic recombination. In the 
case of the a chain a Va gene segment gets next to a Ja 
gene segment and hence a functional exon is formed. 
Transcription and splicing of the VJa exon to the 
constant region leads to the formation of the mRNA which 
is translated into the TCR a chain. The rearrangement of 
the Vp, Dp and jp gene segments coding for the variable 
domain of the P chain creates a functional exon which is 
transcribed and attached to Cp by splicing. The mRNA 
which forms is translated into the TCR p chain. The a 
and p chains join together after their biosynthesis to 
form an a : p TCR heterodimer. The highly variable 
region of the TCR which is responsible for the 
specificity of antigen recognition and is located in the 
linkage region of the V, (D) and J gene segments is 
referred to as the CDR3 region. 

Due to the high variability of T cell receptors it is 
very time-consuming to identify specific nucleotide and 
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amino acid sequences in particular in the area of the 
CDR3 antigen recognition region. There is therefore a 
great need to provide nucleic acid and amino acid 
sequences of T cell receptors which are able to 
specifically recognize clinically relevant peptide 
antigens in particular tumour-specific peptide antigens. 

According to the invention tumour-infiltrating 
lymphocytes (TIL) could be isolated from a kidney 
carcinoma which have a high specificity for tumour 
tissue from patients with the HLA-A*02 01 allele. These 
TIL show no reaction with healthy kidney tissue from the 
same patient. 



An analysis was carried out- of the nucleotide and amino 
acid sequences of the T cell receptors expressed by 
these TIL. In this process a homogeneous CD8 + T cell 
clone was firstly obtained by culturing and periodic 
restimulating the TIL over a period of 62 and 74 days 
respectively. The cDNA coding for the a and 0 chain of 
the T cell receptor was sequenced. The nucleotide and 
amino acid sequence of the a chain are shown in the 
sequence protocols SEQ ID NO. 1 and SEQ ID NO. 2. The 
CDR3a region in SEQ ID NO. 1 extends from bp 313 to 348 
corresponding to the amino acids 87-98 in SEQ ID NO. 2. 
The nucleotide and amino acid sequence of the 0 chain 
are shown in the sequence protocols SEQ ID NO. 3 and 
SEQ ID NO. 4. The CDR30 region in SEQ ID NO. 3 extends 
from bp 331 to 369 in SEQ ID NO. 3 corresponding to the 
amino acids 90-102. 



In the case of the a chain a combination of Va2 0 with 
Ja22 was found in the variable region and in the case of 
the 0 chain a combination of V022, D02 and J02 . 7 . 
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Subsequently a sequence analysis of the tumour-specific 
T cell receptors was carried out with a culture for only 
2 4 days. In this case a homogeneous T cell clone was not 
found but rather a mixture of several T cell species. 
The amino acid sequence shown in SEQ ID NO. 2 as well as 
in all two further amino acid sequences were able to be 
identified for the a chain. 11 out of 56 examined T cell 
species coded for the amino acid sequence shown in SEQ 
ID NO. 2 of the CDR3ot region from position 87 to 98. the 
nucleotide sequence of the a chains in these T cells 
differed from the sequence shown in SEQ ID NO. 1 only by 
a substitution of T by G at position 324. 

The nucleotide and amino acid sequence of the CDR3 
region of a further a chain which was identified in 38 
out of the 56 examined T cells is shown in the sequence 
protocols SEQ ID NO. 5 and 6. In addition two further 
T cell species were identified which contained a CDR3a 
region with the same amino acid sequence to that shown 
in SEQ ID NO. 6 but whose nucleotide sequence each 
differed by a base substitution (C at position 9 
substituted by G or T at position 12 substituted by C) . 

The nucleotide and amino acid sequence of the CDR3ct 
region from a third T cell variant which occurred at a 
frequency of 5 out of 56 examined T cell species is 
shown in the sequence protocols SEQ ID NO. 7 and 8. 

The corresponding sequencing of the p chains yielded a 
total of 6 different amino acid sequences for the CDR3 
region. A CDR30 sequence which was found in 15 out of 50 
examined T cells is shown in the sequence protocols 
SEQ ID NO. 9 and 10. A further T cell species contained 
the same amino acid sequence but a different nucleotide 
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sequence (substitution of A at position 15 by T) . 

One T cell species in each case contained the nucleotide 
and amino acid sequences shown in the sequence protocols 
SEQ ID NO. 11 and 12, 13 and 14 or 15 and 16 in the 
CDR3p region. 

27 out of 50 clones contained the nucleotide and amino 
acid sequences shown in the sequence protocols 17 and 18 
in the CDR3p region. 4 out of 50 examined clones 
contained the nucleotide and amino acid sequences shown 
in the sequence protocols SEQ ID NO. 19 and 2 0 in the 
CDR3p region. 

In addition an in situ sequencing of TIL was carried out 
i.e. a sequencing without prior culture. For this the 
entire RNA was isolated from the tumour, a TCR-specific 
cDNA was prepared using a TCRot- or TCRp-specif ic primer 
and reverse transcriptase and this cDNA was selectively 
amplified by PCR using family-specific primers (Vot2 0 and 
V|322) . The amplification products were cloned into 
E. coli and sequenced. In this process a series of 
single sequences was obtained. 

Circa 60 % of all sequences of the a chain correspond to 
the amino acid sequences shown in the sequence protocols 
SEQ ID NO. 2, 6 and 8. A further 20 % had very similar 
sequences which were also composed of a combination of 
Vot20 and Ja22. An overview of the CDR3a regions 
identified in this in situ sequencing of T cells from 
patient 2 6 is shown in Fig. 1. 

Furthermore it was found in the in situ sequencing that 
ca. 70 % of all sequences of the (3 chain correspond to 
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the amino acid sequences shown in the sequence protocols 
4, 10, 12, 14, 16, 18 and 20. An overview of the CDR3 
sequences of the p chain identified in the in situ 
sequencing is shown in Fig. 2. 

In a control experiment TIL from another patient with 
the HLA-A*02 01 allele were analysed by in situ 
sequencing. It was found that the CDR3a regions of 15 
and 4 of the total of 3 4 examined T cell species 
contained the amino acid sequences shown in SEQ ID NO. 2 
and SEQ ID NO. 6. An overview of the relevant CDR3a 
sequences and their frequency is shown in Fig. 3. An 
overview of the results which were obtained when 
sequencing the CDR3 regions of the 3 chain is shown in 
Fig. 4. 

Hence a first aspect of the present invention concerns a 
nucleic acid which codes for the a chain of a human T 
cell receptor, a functional derivative or a fragment 
thereof and comprises a CDR3 region composed of a 
combination of a Va2 0 gene segment and a Ja22 gene 
segment. The length of the amino acid section coded by 
this CDR3 region is 11-14 amino acids and preferably 12 
or 13 amino acids. The CDR3 region particularly 
preferably codes for one of the amino acid sequences 
shown in the sequence protocols SEQ ID NO. 2, 6 and 8, a 
sequence that is at least 80 % and in particular at 
least 90 % identical to this or a sequence which codes 
for an amino acid sequence with an equivalent 
recognition specificity for the peptide component of the 
T cell receptor ligand. 

A further aspect of the present invention is a nucleic 
acid which codes for the a chain of a human T cell 
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receptor, or for a functional derivative or a fragment 
thereof and comprises a CDR3 region selected from: 

(a) a nucleotide sequence coding for the amino acid 
sequence 

Y C L (X^.-Xn) S A R Q L T F (I) 

in which X x . . . x n represents a sequence of 3-5 
amino acids, 

(b) a nucleotide sequence which codes for an amino 
acid sequence which is at least 80 % and in 
particular at least 90 % identical with the amino 
acid sequence from (a) or 

(c) a nucleotide sequence which codes for an amino 
acid sequence with an equivalent recognition 
specificity for the peptide component of the 

T cell receptor ligand. 

The amino acid sequence X ± . . . X n is preferably selected 
from the group comprising the amino acid sequences VGG, 
VLSG, ATG, VSG, DSG, WSG, ALAG, APSG and VGR. The amino 
acid sequence X x ... X n is particularly preferably 
selected from the amino acid sequences VGG, VLSG and 
ATG. 

A particular feature of the tumour-specific CDR3a 
regions of the invention is a length of 12-13 amino 
acids and a common sequence motif. Thus if the sequence 
x l • • • x n has a length of 3 amino acids X x is preferably 
V or A, X 2 is preferably T, G or S and X 3 is preferably 
G. If the length of the sequence X x ... X n is 4 amino 
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acids then preferably X 2 = V or A, at least one of X 2 or 
X 3 is T or S and X 4 = G. 

A sequencing of the p chains from both patients that 
were examined yielded a combination of the gene segments 
Vp22, Dpi or Dp2 and JP2 . 7 for the first patient and a 
combination of the gene segments Vp22, Dpi or DP2 and jp 
2,1, Jp2.3 or JP2.7 for the second patient. 

Hence a further aspect of the present invention is a 
nucleic acid which codes for the P chain of a human T 
cell receptor, or for a functional derivative or a 
fragment thereof and comprises a CDR3 region which is 
composed of a combination of a Vp22 gene segment of a 
Dpi or Dp2 gene segment and of a Jp gene segment in 
particular of a Jp2.1, Jp2 . 3 or Jp2.7 gene segment. 

The length of the amino acid section coded by this CDR3P 
region is 12-14 amino acids, preferably 13 amino acids. 
Furthermore this CDR3p region preferably contains a 
common sequence motif i.e. X-T or S-X-S in which X 
represents an arbitrary amino acid and T or S 
particularly preferably denote T. A total of 70 % of the 
examined T cell receptors have such a sequence pattern. 

Yet a further aspect of the present invention is a 
nucleic acid which codes for the p chain of a human T 
cell receptor, or for a functional derivative or a 
fragment thereof and comprises a CDR3 region which is 
selected from: 
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(a) a nucleotide sequence coding for the amino acid 
sequence 

C A (X»! . . . X» n ) Y/D E Q Y F (II) 
in which X' x . .. X' n represents a sequence of 5-7 
amino acids, 

(b) a nucleotide sequence coding for the amino acid 
sequence 

C A (X"! ... X" n ) N E Q F F (III) 

in which X • » ± ... x ■ ■ n represents a sequence of 
5-7 amino acids, 

(c) a nucleotide sequence coding for the amino acid 
sequence 

C A (X' ■ «! ... X' " n ) D T Q Y F (IV) 

in which x» » ^ ... X' ' » n represents a sequence of 
5-7 amino acids, 

(d) a nucleotide sequence which codes for an amino 
acid sequence that is at least 80 % and in 
particular at least 90 % identical with an amino 
acid sequence from (a) , (b) or/and (c) , or 

(e) a nucleotide sequence which codes for an amino 
acid sequence with an equivalent recognition 
specificity for the peptide component of the 

T cell receptor ligand. 



The amino acid sequence X 1 ± ... x' n is preferably 
selected from the group comprising SSETNS, SSETSS, 



- 10 - 



TSGTAS , RSGTGS, SSGTDS, SSGTRS, SSGSDS, SSSTGS, SSSTVS, 
SSSTLS, SSSTLF, SSSTAS, SSHTDS, SSDTLS and SRWDSE. The 
amino acid sequence X 1 ! ... X' n particularly preferably 
represents SSETNS, SSGTDS, TSGTAS or RSGTGS. The amino 
acid sequence X'^ ... x»' n preferably denotes SSGTSSY 
or SSDQGM. The amino acid sequence X 11 ^ ... X 1 1 ' n 
preferably denotes SADSFK. 

Within the sense of the present invention the term 
"functional derivative of a chain of a human T cell 
receptor" is understood as a polypeptide which comprises 
at least one CDR3a or/and CDR3p region as defined above 
and together with the respective complementary chain of 
the human T cell receptor (or a derivative of such a 
chain) can form a T cell receptor derivative which has 
an equivalent recognition specificity for a peptide 
ligand presented by a MHC molecule to that of the non- 
derivatized T cell receptor. Such a T cell receptor has 
a binding constant of at least 1CT 4 1/mol, preferably 
10~ 4 to 1CT 5 1/mol for the presenting peptide ligand. 

Functional derivatives of chains of a human T cell 
receptor can for example be prepared by deletion, 
substitution or/and insertion of sections of the gene 
coding for the respective polypeptide by means of 
recombinant DNA techniques. The preparation of 
recombinant T cell receptor chains is for example 
described in Blank et al. (1993), Eur. J. Immunol. 23, 
3057-3065; Lin et al. (1990) Science 249: 677, Gregoire 
et al. (1991), Proc. Natl. Acad. Sci. USA, 88: 8077; 
Kappes and Tonegawa (1991), Proc. Natl. Acad. Sci. USA 
88: 10619 and Ward (1991), Scand. J. Immunol. 34: 215. 
Explicit reference is herewith made to these literature 
citations . 
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Particular preferred functional derivatives of T cell 
receptor chains or T cell receptors are single chain T 
cell receptors which can for example be composed of the 
variable domains of the a and P chain and a constant 
domain. The preparation of such constructs is described 
by Chung et al. (1994), Proc. Natl. Acad. Sci. USA 91: 
12654-12658. A further preferred example of functional 
derivatives are soluble TCR fragments which can be 
prepared as separate polypeptides or as single chain 
polypeptides cf . e.g. Hilyard et al. (1994), Proc. Natl. 
Acad. Sci. USA 91; 9057-9061. Explicit reference is also 
made to the disclosure in these literature citations. 

A further subject matter of the present invention is a 
vector which contains at least one copy of a nucleic 
acid according to the invention. This vector can be a 
prokaryotic vector or a eukaryotic vector. Examples of 
prokaryotic vectors are plasmids, cosmids and 
bacteriophages. Such vectors are described in detail in 
Sambrook et al., Molecular Cloning. A Laboratory Manual, 
2nd Edition (1989), Cold Spring Harbor Laboratory Press, 
in chapters 1-4. The prokaryotic vector is preferably a 
plasmid. 

On the other hand the vector can also be a eukaryotic 
vector e.g. a yeast vector, a plant vector (bacolovirus) 
or a mammalian vector (a plasmid vector or a viral 
vector) . Examples of eukaryotic vectors are described in 
Sambrook et al., Supra, chapter 16 and Winnacker, Gene 
and Klone, "Eine Einfiihrung in die Gentechnologie" 
(1985), VCH "Verlagsgesellschaft" in particular in 
chapters 5, 8 and 10. 

Yet a further subject matter of the invention is a cell 
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which expresses a nucleic acid according to the 
invention or a cell which is transformed with a nucleic 
acid according to the invention or with a vector 
according to the invention. The cell can be a 
prokaryotic cell (e.g. a gram-negative bacterial cell, 
in particular E. coli) or a eukaryotic cell (e.g. a 
yeast, plant or mammalian cell) . Examples of suitable 
cells and methods for introducing the nucleic acid 
according to the invention into such cells may be found 
in the above literature references. 

A further subject matter of the present invention is a 
polypeptide which is coded by a nucleic acid according 
to the invention. The polypeptide particularly 
preferably contains the variable domain of the a or/ and 
3 chain of a human T cell receptor. 

A polypeptide is particularly preferred which has T cell 
receptor properties and is composed of a TCRot chain or a 
functional derivative thereof as well as a TCR0 chain or 
a functional derivative thereof as subunits. The 
polypeptide can be composed of two separate chains or be 
present as a single chain polypeptide. In addition the 
polypeptide may also be present in an oligomerized form 
in which at least 2 and preferably 2-10 TCRot and TCR0 
chains are linked together. The linkage can for example 
be achieved by means of bifunctional chemical linkers. 

Yet a further subject matter of the present invention is 
an antibody against a polypeptide according to the 
invention which is directed towards a region of the 
polypeptide which is responsible for recognizing the 
peptide ligand. This antibody can be a polyclonal 
antiserum, a monoclonal antibody or a fragment of a 
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polyclonal or monoclonal antibody (e.g. a Fab, F(ab) 2 , 
Fab 1 or F(ab') 2 fragment). The antibody is preferably 
directed towards a CDR3 region of the polypeptide or an 
area thereof. Such antibodies can be obtained by well- 
known methods by immunizing an experimental animal with 
a peptide or polypeptide which contains a CDR3 region 
according to the invention and isolating the resulting 
antibodies from the experimental animal. Monoclonal 
antibodies can be obtained by fusing an antibody- 
producing B cell of the experimental animal with a 
leukemia cell according to the method of Kohler and 
Milstein or further developments thereof. Specific 
examples of the production of such antibodies can be 
found in Choi et al. (1991), Proc. Natl. Acad. Sci. USA 
88: 8357-8361 and Zumla et al. (1992), Hum. Immunol. 35: 
141. 

Yet a further subject matter of the present invention is 
a T cell which contains a T cell receptor according to 
the invention. Such T cells can be isolated from 
patients with kidney cell carcinoma and then be expanded 
in vitro. For this the peripheral mononuclear blood 
cells of a patient can for example be produced by 
stimulation with suitable antigens and subsequent 
restimulation for example with an irradiated autologous 
lymphoblastoid cell line, tumour cells, lymphoblastoid 
cells plus antigens or autologous peripheral blood 
lymphocytes plus antigen. Further methods for obtaining 
T cells according to the invention are described below. 

The invention also concerns a pharmaceutical composition 
which contains a nucleic acid, a polypeptide, a peptide 
ligand capable of binding to the polypeptide optionally 
in association with a corresponding MHC molecule, an 
antibody or a cell as described above as active 
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components optionally together with other active 
components as well as common pharmaceutical auxiliary 
substances, additives or carrier substances. Examples of 
other active components are accessory stimulating 
components e.g. cytokines such as IL-2 and IL-4 . 

The pharmaceutical composition can be used to produce a 
diagnostic or therapeutic agent. Examples of diagnostic 
applications are the diagnosis of tumour diseases or a 
predisposition for tumour diseases. A further preferred 
diagnostic application is the monitoring of the disease 
course in a tumour disease e.g. after chemotherapy or a 
surgical operation. 

The use of the pharmaceutical composition as a 
diagnostic agent preferably comprises the detection of a 
T cell subpopulation which expresses a polypeptide 
according to the invention as a T cell receptor. The 
detection of this T cell receptor can for example be 
achieved at the nucleic acid level e.g. by a nucleic 
acid hybridization assay optionally with a prior 
amplification. On the other hand the detection can also 
be carried out at the protein level by an immunoassay 
using antibodies that react specifically with the T cell 
receptor. In addition it is also possible to detect the 
T cells for example by means of a test for binding to 
specific peptide ligands or in an activity test in which 
the specific cytotoxic action of the T cells or the 
release of cytokines such as TNF or IFNy is determined. 

Furthermore the pharmaceutical composition according to 
the invention can also be used therapeutically in 
particular for the prevention or therapy of a tumour 
disease e.g. of a kidney cell carcinoma. This 
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therapeutic application can for example be based on the 
fact that T cells which express the tumour specific T 
cell receptor are stimulated to grow in vitro or in 
vivo. The growth stimulation in vivo can for example be 
achieved by administering the peptide ligand of the T 
cell receptor or/and the whole molecule from which the 
peptide ligand is derived or a fragment thereof. 
Furthermore the growth stimulation in vivo can also be 
accomplished by administering an antibody which 
specifically activates the T cell receptor by binding 
e.g. a monoclonal antibody or a monoclonal antibody 
fragment. 

On the other hand the growth stimulation of the T cells 
can also be carried out in vitro for example by 
isolating specific T cells from the patient, in vitro 
expansion and subsequent administration of the expanded 
T cells as a tumour vaccine. T cells which express a 
tumour specific T cell receptor are isolated from a 
patient preferably by contacting a sample from the 
patient which contains T cells, e.g. a blood sample and 
preferably a sample derived from the tumour tissue, with 
an agent which specifically binds to the CDR3 region of 
the T cell receptor, identifying the T cells which react 
with the agent and optionally separating them from other 
T cells. The agent that binds to the CDR3 region of the 
T cell receptor is preferably selected from the peptide 
ligand of the T cells, a peptide ligand-MHC complex 
or /and an anti-TCR antibody. Optionally the in vitro 
expansion can additionally be carried out in the 
presence of costimulatory factors such as anti-CD28 
antibodies. In order to facilitate separation of the 
desired T cell subpopulation, the agent is preferably 
used in an immobilized or immobilizable form. 
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The isolation of T cells which express a tumour specific 
T cell receptor can, however, also be achieved in 
another manner e.g. by introducing nucleic acid 
sequences which code for the T cell receptor into a T 
cell line, preferably a cytotoxic T cell line. The T 
cell receptor is then expressed in this transfected T 
cell line. In this manner it is possible to obtain T 
cells in large amounts which express a tumour specific T 
cell receptor . 

Yet another method for isolating T cells which express a 
tumour specific T cell receptor is to introduce nucleic 
acid sequences which code for the T cell receptor into 
the germ line of an animal and to isolate the T cells 
from the resulting transgenic animal or its descendants. 
Transgenic mice are preferably produced. Furthermore it 
is preferred that the transgenic mice also express the 
human CD8 molecule or/and the human HLA-A*0201 molecule 
in addition to the T cell receptor. 

Hence a further subject matter of the present invention 
is also a transgenic animal which has T cells that 
express a tumour specific T cell receptor. This 
transgenic animal is preferably a rodent in particular a 
mouse. 

Finally the invention also concerns a method for the 
identification of peptide ligands of a T cell receptor 
according to the invention. This method preferably 
comprises the steps: 

(a) isolating RNA from tumour tissue, 
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(b) converting the RNA into double-stranded cDNA 
molecules, 

(c) introducing the cDNA molecules into host cells to 
obtain a cDNA bank, 

(d) transfecting eukaryotic recipient cells with 
aliquots of the cDNA bank in which (i) there is 
a cotransfection with HLA-A*0201 DNA or (ii) 
HLA-A*02 01-positive recipient cells are used, 

(e) testing the transfected recipient cells for their 
ability to secrete cytokines such as TNF in which 
case it is possible for example to examine the 
lysis of TNF-sensitive cells, 

(f) identifying a cDNA sequence which codes for the 
antigen which contains the peptide ligand and 

(g) identifying the sequence of the peptide ligand. 

The invention is further elucidated by the following 
examples, figures and sequence protocols. 

SEQ ID NO. 1: shows the nucleotide sequence of the TCRot 
chain of a T cell receptor according to 
the invention in which bp 55-324/3 25 codes 
for the TCR-V0C2 0 gene segment, bp 325/326 
codes for the TCR Ja22 gene segment, 
bp 381-804 codes for the TCR-Ca gene 
segment and bp 805-1341 represent a 
3' untranslated region, 

SEQ ID NO. 2: shows the amino acid sequence of the 



- 18 - 



nucleotide sequence shown in SEQ ID NO. 1, 
SEQ ID NO. 3: shows the nucleotide sequence of the TCRP 
chain of a T cell receptor according to 
the invention in which bp 1-63 code for 
the leader peptide, bp 64-341 code for 
the TCR-VJ322 gene segment, bp 342-345 are 
nucleotides, bp 34 6-349 code for the 
TCR-Dp2 gene segment, bp 350 is an 
N-nucleotide, bp 351-398 code for the 
TCR-Jp2.7 gene segment and bp 399-936 code 
for the TCR-Cp gene segment, 
SEQ ID NO. 4: shows the amino acid sequence of the 

nucleotide sequence shown in SEQ ID NO. 3, 

SEQ ID NO. 5 

and 6 show nucleotide and amino acid sequences 

of the CDR3a region of a T cell receptor 
according to the invention 

SEQ ID NO. 7 

and 8: show nucleotide and amino acid sequences 

of the CDR3a region of a T cell receptor 
according to the invention 

SEQ ID NO. 9 

and 10: show nucleotide and amino acid sequences 

of the CDR3p region of a T cell receptor 
according to the invention 

SEQ ID NO. 11 

and 12 : show nucleotide and amino acid sequences 

of the CDR3P region of a T cell receptor 
according to the invention 

SEQ ID NO. 13 

and 14 : show nucleotide and amino acid sequences 

of the CDR3p region of a T cell receptor 
according to the invention 

SEQ ID NO. 15 

and 16: show nucleotide and amino acid sequences 
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of the CDR3p region of a T cell receptor 
according to the invention 

SEQ ID NO. 17 

and 18 show nucleotide and amino acid sequences 

of the CDR3p region of a T cell receptor 
according to the invention 

SEQ ID NO. 19 

and 20: show nucleotide and amino acid sequences 

of the CDR3p region of a T cell receptor 

according to the invention 
SEQ ID NO. 21 shows the nucleotide sequence of the TCRot- 

specific primer P-CaST, 
SEQ ID NO. 2 2 shows the nucleotide sequence of the TCRp- 

specific primer P-CpST 

Fi 9- 1 shows nucleotide and amino acid sequences 

of the CDR3a regions from tumour-specific 
TCR which have been determined by in situ 
sequencing of T cells of patient 26 , 
Fi 9- 2 shows nucleotide and amino acid sequences 

of CDR3p regions of tumour-specific TCR 
which have been determined by in situ 
sequencing of T cells of patient 26, 
Fi 9 # 3 shows nucleotide and amino acid sequences 

of CDR3a regions of tumour-specific TCR 
which have been determined by in situ 
sequencing of T cells of patient 22, 
Fi 9- 4 shows nucleotide and amino acid sequences 

of CDR3p regions of tumour-specific TCR 
which have been determined by in situ 
sequencing of T cells of patient 22, 
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Example 1 

Analysis of T cell receptors in HLA-A2 patients with 
kidney cell carcinoma 

Cytotoxic CD8+ T cells were identified in kidney cell 
patient 2 6 which lysed autologous tumour cells with a 
HLA-A2 restricted mechanism. The T cells have a high 
tumour specificity since short-term cultures of normal 
kidney cells are not recognized. The determinants 
recognized by the TIL of patient 26 were also found on 
other tumours of patients which carry the HLA-A2 gene in 
particular the widespread HLA-A*0201 allele. Normal 
kidney cells of these patients were not lysed. These 
results show that the kidney carcinoma cells of patient 
2 6 express a tumour determinant i.e. a tumour-associated 
peptide/HLA-A2 complex which is also present on the 
tumours of other patients. 

Total RNA is isolated from T cells in order to identify 
and sequence tumour-specific TCR. For this the cells in 
suspension are washed with PBS and the cell pellet is 
resuspended with 0.2 ml RNazol-B per 1 x 10 6 cells. 2 ml 
RNazol-B per 100 mg tissue is added to extract the RNA 
from the tissue. After mechanically resuspending the 
lysates several times and optionally adding yeast tRNA 
as a carrier matrix, the RNA is extracted by adding 0.2 
ml chloroform per 2 ml homogenate, subsequently mixing 
for 15 sec. and storing for 5 minutes on ice. 

After a centrif ugation step at 12,000 g for 15 min at 
4°C, the aqueous phase is removed and transferred into a 
new reaction vessel. The first precipitation of the RNA 
is carried out by adding an identical volume of 
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isopropanol and subsequently storing for at least 15 min 
at 4°C. After centrifuging for 15 min at 12,000 g and 
4°C, the RNA is obtained as a white pellet at the bottom 
of the vessel. 

After discarding the supernatant, the RNA pellet is 
purified of salts by briefly mixing in 75 % ethanol. 
After centrifuging (7500 g, 4°C, 8 min), the pellet is 
dissolved in 175 /xl water treated with diethylpyro- 
carbonate (DEPC) and precipitated again with 500 /xl 
ethanol and 75 /xl 2 M NaCl for at least 1 h at -20 °C. 
The centrifugation and washing steps after the second 
precipitation are carried out as described for the first 
precipitation. After drying the pellets in air, the RNA 
is resuspended in H 2 0-DEPC or 0.5 % SDS, pH 6.5 to 7.0 
or 1 mM EDTA, pH 7.0. 

Subsequently cDNA is synthesized from the RNA by reverse 
transcription. For this 3 /xg total RNA is incubated for 
10 minutes at 55 °C with 30 ng P-caST (a specific primer 
for the TCRa chain with the sequence 5'-CAC TGA AGA TCC 
ATC ATC TG-3- shown in SEQ ID NO. 21) and 30 ng P-C0ST 
(a specific primer for the TCR0 chain with the sequence 
5 * —TAG AGG ATG GTG GCA GAC AG-3 ' shown in SEQ ID NO. 22) 
in a reaction volume of 10 /xl. Afterwards 38 /xl RAV-2-RT 
buffer (100 mM Tris-HCl pH 8.3; 14 0 mM KC1; 10 mM MgCl 2 ; 
2 mM dithiothreitol, 0.1 mM of each dNTP) , l /xl (0.75 U) 
rRNasin and 1 /xl (is u) reverse transcriptase are added 
by pipette. The reverse transcription is carried out for 
1 h at 42 °C, followed by a denaturation step at 68 °C for 
5 min. It is stored until use at -20°c. 

Subsequently a polymerase chain reaction is carried out. 
The primer can be biotinylated in order to enable the 
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PCR products to be subsequently purified by coupling to 

The pcr is carried out using a thermostable DMA 
polymerase and the following reaction scheme: 

H'-l l"tl: nL ed r atUrati ° n ( ° nly " the "eginning, 
as c 30 sec DNA denaturation 

56 °c 1 min annealing 
72 °c l min extension 

relctJon" 1 "," 111 " 9 UP Slngle St "" ds *» ^e 
reaction solution (only at the end) . 

The number of reaction cycles in the PCR is usually 30. 

The PCR fragments obtained in this manner are sequenced. 

When the cytotoxic T cells from patient 26 are cultured 
and periodically restimulated over a period „, ™ ltUred 
aays respectively, a uniform ^V^' £ ^ " 

T^T- ^ nUCle ° tlde «" «" -^enci of the 

xcRa chain of this t p-n _ n lie 

oH . S T Cel1 clone from patient 26 are 

shown in SEQ ID no i »t->^ o m u 

acid „ 1 and 2. The nucleotide and amino 

acid sequence of the tpdR • 

3 and 4. P " Sh ° Wn in SE Q » NO. 

r: r o b homogeneous but - rAi:; ur ;r s ~ 

cent, T° leS - ° DRa reglOTS ° £ the - T cell specks 

rr:: ; r^L'iirn se ~ <seo — - 

<*na b; m addxtion to the amino acid 



- 23 - 



sequence shown in SEQ id no. 2. m addition to the amino 
acid sequence shown in SEQ id NO. 4, the CDR30 regions 
contained further closely related sequences (SEQ ID NO 

Vrjlo) 1 ; ^ 13 — - » - » and 

Furthermore the T cells of patient 26 were sequenced in 
sxtu i.e. sequenced without prior culturing. m this 
process a series of individual sequences was obtained 
for^the CDR3a region which are shown in Fig. lm circa 

< of all sequences of the a chain correspond to the 
sequences previously described. A further 20 % 
correspond to very similar sequences. 

Also in the case of the CDR3 regions of the p chain it 
was found that a total of 7n % n -r 

tai ot 70 « °f the examined T cells 
of patient 26 had a very similar sequence pattern 
(Fig. 2). 

Peripheral blood samples from patient 26 W ere analyzed 
for T oen receptors which have features of tumour- 
specific T ceil receptors over 4 years in all. It was 
found that such sequences only occurred with a frequency 
of about 1/150,000 T cells. 

Cytotoxicity investigations showed that the tumour- 
specific T cells isolated fro. patient 26 could also 
lyse tumour cells of patient 22 which also carry the 
H«- A *o 20 l allele. Tumour infiltrating T cells from 
patent 22 could in turn lyse tumour cells from patient 

viellrtr n ° ln9 ° f T *«- Patient 22 

and . " SUltS Sh ° m ln Fig " 3 f ° r th « CDR3a region 
and m Fig. 4 for the CDR3p region. 
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Examp] e ? 

Expression of T cell receptors 

2.1 Expression of tumour specific T cell receptors 
human or murine T cell lines 



in 



The nucleic acid sequences identified in example 1 
which code for tumour specific TCRa and J3 chains 
are cloned into eukaryotic human and murine 
expression vectors. The human expression vector is 
described in Chung et al. (Proc. Natl. Acad. Sci 
USA 92 (1995): 3712-3716). The murine vectors are 
described in Gabert et al. (cell 50 (1987: 545-554) 
and Gregoire et al. (Proc. Natl. Acad. Sci. USA 88 
(1991): 8077-8081). 

The TCR DNA can either be cloned from rearranged 
genomic DNA or from cDNA. Basically two cloning 
strategies are available: firstly the isolation of 
very long TCRa and 0 DNA fragments from the genome 
of mature T cells which contain several Kb long 5- 
flanking sequences with all regulatory elements 
required for expression. Alternatively vectors can 
be selected which already contain the natural 5 • 
regulatory elements and in which only short 
fragments coding for the variable regions have to 
be cloned in (Kouskoff et al. j. Immunol. Methods 
180 (1995): 273-280). m the latter method the 
sequence of the variable region (including the 
leader sequence) is examined for mistakes by 
sequencing after amplification by means of specific 
PCR and subsequently introduced into the vector 
after digestion with appropriate restriction 
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2.2 



endonucleases . 

The PCRct and 0 chains can either be cloned into a 
common vector or into two different vectors. Each 
of the vectors used contains a selection marker 
which enables the positive selection of 
successfully transfected cells after transfection 
of the recipient cells with the recombinant 
plasmid. Preferred selection markers are for 
example the gene for neomycin resistance (neo) or 
the gene for xanthine-guanine-phosphoribosyl- 
transferase (GPT) . 

Expression of functional T cell receptors as single 
chain constructs 

Similarly to antibodies it is possible to express 
TCR as single chain constructs in eukaryotic cells 
(Chung et al., Proc. Natl. Acad. Sci. USA 91 
(1994): 12654-12658). In this method a construct is 
prepared which also contains the constant domain of 
the 0 chain in addition to the variable domains of 
the TCRa and 0 chain. The individual domains are 
amplified by means of PCR as described in example 1 
after isolation of the corresponding RNA and 
reverse transcription. In this process suitable 
restriction cleavage sites are inserted at the ends 
of the amplification products. The individual 
fragments are then ligated together as follows in a 
eukaryotic expression vector (e.g. pBJ-Neo) which 
carries a positive selection marker: the variable 
TCRa and 0 domains comprising leader, V-(D)- and J 
exon are separated by a linker sequence e.g. a DNA 
fragment coding for the amino acid sequence 



- 26 - 



2.3 



(GGGGS) 3 . The exon for the constant TCR0 domain is 
ligated directly to the variable 0 domain. 

Alternatively coding sequences for a GPI anchor 
(Lin et al., Science 249 (1990): 677-679) or for 
example the transmembrane part and the 
intracellular domain of the CD3Q chain (Engel et 
al., Science 256 (1992): 1318-1321) can be ligated 
to the 3- end of this construct. After transection 
of these constructs in eukaryotic cells, the former 
enables the production of soluble TCR molecules 
which can be used as an immunogen to produce 
antibodies. The latter enables the functional 
analysis of the construct in biological systems. 

Production of soluble human TCR fragments in E. coli 

Large amounts of soluble TCR fragments can be 
produced in E. coli as single chain polypeptides 
(Hilyard et al., Proc. Natl. Acad. Sci. USA 91 
(1994): 9057-9061). 

For this various genes or gene fragments are cloned 
into an inducible prokaryotic vector e.g. P UC19 . 
The fragments to be ligated are reamplified by 
means of specific PCR in the process of which 
suitable restriction cleavage sites are added. 

The following fragments are cloned into the vector 
in the order shown: 



1. 



A prokaryotic signal sequence e.g. the pelB- 
leader sequence from the pectate lyase gene of 
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Erwinia carolovora (Ward et al., Nature 341 
(1989) : 8646-8650) which causes a secretion of 
the polypeptide into the periplasm of the host 
bacterium. 

2. The variable PCRa and p chain fragments from a 
tumour-specific TCR. These fragments are 
preferably separated by a linker e.g. the 
linker shown in example 2.2. which improves the 
solubility and the flexibility of the 
synthesized molecule. 

3. A nucleotide sequence coding for a tail made of 
several e.g. 6 histidine residues which enables 
the recombinant polypeptide to be isolated by 
affinity chromatography e.g. by nickel chelate 
chromatography . 

Example 3 

Production of antibodies against tumour-specific T cell 
receptors 

Mice are immunized with the appropriate antigen to 
produce antisera or monoclonal antibodies against 
tumour-specific TCR. The immunization is carried out 
according to the protocols given by Harlow, E. and 
David, C, Antibodies. A Laboratory Manual, Cold Spring 
Harbor Laboratory, 1988. TCR expressing cells (example 
2.1) or soluble TCR (example 2 . 2 or example 2.3) can for 
example be selected as antigens. 



Alternatively the soluble TCR used for the immunizati 
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can also be produced as chimeric proteins which are 
composed of a variable TCR region, a truncated constant 
TCR region and a constant immunoglobulin region (cf . 
e.g. Gregoire et al. (1991), Supra). For this the 
specific variable TCRcc and p regions are each cloned 
into a plasmid which already contains the first exon, a 
corresponding c region and an IgGk domain. Both plasmids 
additionally contain a positive selection marker and the 
regulatory elements required for correct expression. 
Both plasmids are then used to transfect a mouse myeloma 
cell line which does not express endogenous heavy and 
light Ig chains. After the transfection is completed 
both chimeric chains are synthesized and preferentially 
secreted as heterodimers . 

Alternatively a TCR protein antigen for immunizing mice 
can be constructed as follows: A human V gene segment is 
fused to a TCR gene segment composed of (D) , J and C 
gene segments from a mouse T cell hybridoma i.e. the 
gene segments are cloned in this order into a eukaryotic 
expression vector (Choi et al., Proc. Natl. Acad. Sci. 
USA 88 (1991): 8357-8361). The human sequence is 
obtained from the corresponding cDNA by means of PCR by 
amplifying the V region. Such constructs are then used 
to transfect mouse T cell hybridomas which provide all 
components apart from the corresponding transfected 
chains. Since the plasmids also code for selection 
markers, transf ectants can be positively selected by an 
appropriate medium. Since these transf ectants represent 
mouse T cells which express a human V region, mice that 
are immunized with such cells only produce antibodies 
against this foreign human sequence. 
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Example 4 



Identification of the peptide ligands of tumour specific 
T cells 



Poly-A+ ntRNA is isolated from a kidney cell carcinoma 
line using a commercial kit (Fastrack/Invitrogen) and 
converted into double-stranded cDNA using the 
Superscript Choice System kit (Gibco) using a 
Notl/Oligo-dT primer for the first strand synthesis. The 
cDNA is ligated with BstXI adaptors and cleaved with 
Notl. High molecular size fractionated cDNA is selected 
and cloned into the vector pcDNAl/Amp (Invitrogen) 
cleaved with BstXI and Notl. 

E. coli DH5<x cells are transformed by electroporation 
with the recombinant plasmids and selected with 
ampicillin. The cDNA bank obtained in this manner is 
divided into 1500 pools each comprising approximately 
100 clones. Each pool is amplified to saturation and the 
plasmid DNA is isolated from this by alkaline lysis 
without phenol extraction. 

In each case approximately 100 ng plasmid DNA of a pool 
is transfected together with 50 ng plasmid DNA of the 
same vector which carries the HLA-A*0201-cDNA (gene 
bank, ACC No.: M32322, K02883, M84379, X02457) into 
15000 COS7 cells according to the DEAE-dextran- 
chloroquine method. Alternatively the C0S7 cells can 
also be transfected with the HLA-A*020l DNA and the 
stable transfectants obtained in this manner can be used 
as recipient cells. 

24-48 hours after transfection the COS7 cells are tested 
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for their ability to stimulate the release of TNF by 
tumour specific cytotoxic T cells (CTL) . A test is 
carried out in each case with 200 pools i.e. 200 
independent transf ections of C0S7 cells. 

For this 3 000 CTL are added to the wells of microtitre 
plates containing COS7 transf ectants. After 18 hours the 
supernatant of the medium is collected and its TNF 
content is determined using an activity test in which 
TNF sensitive cell lines such as the mouse fibroblast 
cell lines WEHI 164 or L929 are lysed by TNF. Viable 
cultures can be distinguished from lysed cells by a 
colorimetric test using 3- (4 , 5-dimethylthiazol-2-yl) 2 , 5- 
diphenyltetrazolium bromide (MTT) . 

A new cycle of C0S7 transfection was carried out for 
each positive microculture in which smaller pools of 
bacteria from the original pool containing a total of 
100 clones were used in each case. This procedure is 
repeated until a single plasmid is identified which can 
xnduce the TNF release from the specific TCL after co- 
expression with HLA-A*02 01 cDNA in C0S7 cells. 

The sequence of the plasmid insertion is determined by 
standard methods. The confirmation that this sequence 
codes for the tumour peptide is achieved by transfecting 
normal human HLA- A *020l cells which are not lysed by the 
tumour specific CTL. These cells are sensitive for a 
lysxs after transfection with the corresponding cDNA. 
Furthermore the tumour specific expression of the 
identified cDNA is determined by Northern blot using the 
CDNA as a probe. This probe is used for hybridization to 
mRNA from various tumour cell lines of normal tissue 
samples. 



- 31 



The tumour specific peptide can be identified by various 
methods. The corresponding protein sequence is derived 
from the cDNA sequence and screened for binding motifs 
which had been identified, in other HLA-A*0201 binding 
peptides. Synthetic peptides which overlap with 
potential HLA-a*0201 binding regions are then tested for 
their ability to activate CTL after incubation with HLA- 
A 0201 cells. Alternatively overlapping peptides of 8-9 
ammo acids in length can be produced by synthesis and 
tested in a similar manner. 

Example s 

Production of transgenic mice 

Total RNA is isolated from a specific T cell clone and 
CDNA is synthesized by reverse transcription (cf . 
example 1) . using primers specific for the V region 
TCR-cDNA for the va and v3 regions is amplified and' 
cloned into TCR gene cassettes which contain constant 
regions and the necessary regulation elements for 
expression. Separate cassettes for TCRa and TCR0 
sequences are known which each carry a different 
selection marker (Kouskoff et al., (1995), Supra). 

Fertilized mouse oocytes are simultaneously 
microinjected with DNA from the TCRa as well as from the 
TCR0 cassettes. The injected oocytes are transferred 
back into female mice (Mellor, A.L., Transgenesis and 
the T cell receptor, in: T cell receptors (1995), J. i 
Bell, M . J. Owen and E. Simpson, eds. pp 194-223,' Oxford 
University Press, Oxford, New York, Tokyo). 

The introduction of productively rearranged TCR genes in 
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the mouse has a major influence on the TCR repertoire 
since rearranged TCR foreign genes prevent the further 
rearrangement of endogenous TCR genes. Consequently 
nearly all thymocytes and T cells express the 
heterologous TCR clonotype so that the TCR repertoire in 
such mice is essentially monoclonal. 

Transgenic mice are identified by genotype analysis 
using probes which are specific for the DNA contained in 
the foreign gene that does not occur in the mouse 
genome. This can either be carried out by Southern blot 
hybridization or preferably by PCR. 



Transgenic descendants of the mice are obtained by 
crossing with non-transgenic mice of a suitable strai 
typing the descendants and using them for further 
crossing. 
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Claims 



1. Nucleic acid which codes for the a chain of a human 
T cell receptor, or for a functional derivative or 
a fragment thereof and which comprises a CDR3 
region formed from a combination of a Va20 and Ja22 
gene segment. 



2. 



Nucleic acid which codes for the a chain of a human 
T cell receptor, or for a functional derivative or 
a fragment thereof and comprises a CDR3 region 
selected from: 



(a) a nucleotide sequence coding for the amino 
acid sequence 



Y C L (X x . . .X n ) S A R Q L T F 



(I) 



in which Xi ... x n represents a sequence of 
3-5 amino acids, 

(b) a nucleotide sequence which codes for an amino 
acid sequence which is at least 80 % identical 
with the amino acid sequence from (a) , or 

(c) a nucleotide sequence which codes for an amino 
acid sequence with an equivalent recognition 
specificity for the peptide component of the 

T cell receptor ligands. 
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3. Nucleic acid as claimed in claim 2, 
wherein 

the amino acid sequence X x . . . x n is selected from 
the group comprising the amino acid sequences VGG, 
VLSG, ATG, VSG, DSG, WSG, ALAG, APSG and VGR. 

4. Nucleic acid as claimed in claim 3, 
wherein 

the amino acid sequence X ± . . . x n is selected from 
the group comprising amino acid sequences VGG, VLSG 
and ATG. 

5. Vector, 
wherein 

it contains at least one copy of a nucleic acid as 
claimed in one of the claims 1 to 4 . 

6. Cell, 
wherein 

it expresses a nucleic acid- as claimed in one of 
the claims 1 to 4 . 

7. Cell, 
wherein 

it is transformed with a nucleic acid as claimed in 
one of the claims 1 to 4 or with a vector as 
claimed in claim 5. 

8. Polypeptide, 
wherein 

it is coded by a nucleic acid as claimed in one of 
the claims 1 to 4 . 
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3. Polypeptide as claimed in claim 8, 
wherein 

it comprises the variable domain of the a chain of 
a human T cell receptor. 

.0. Nucleic acid which codes for the p chain of a human 
T cell receptor, or for a functional derivative or 
a fragment thereof and comprises a CDR3 region 
formed from a combination of a Vp22 gene segment, a 
DPI or Dp2 gene segment and a jp gene segment in 
particular a JP2.1, Jp2.3 or Jp2 . 7 gene segment. 

1. Nucleic acid which codes for the P chain of a human 
T cell receptor, or for a functional derivative or 
a fragment thereof and comprises a CDR3 region 
which is selected from: 

(a) a nucleotide sequence coding for the amino acid 
sequence 

C A (X'i ... X' n ) Y/D E Q Y F (II) 

in which X' ± ... X' n represents a sequence of 
5-7 amino acids, 

(b) a nucleotide sequence coding for the amino acid 
sequence 

C A (X' " 2 ... X" n ) N E Q F F (III) 

in which x» » x ... x' ' n represents a sequence of 
5-7 amino acids, 
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(c) a nucleotide sequence coding for the amino acid 
sequence 

C A (X 111 ! ... X ,!, n ) D T Q Y F (IV) 

in which X 11 ^ ... X ■ 1 * n represents a sequence 
of 5-7 amino acids, 

(d) a nucleotide sequence which codes for an amino 
acid sequence that is at least 80 % identical 
with an amino acid sequence from (a) , (b) 
or/and (c) , or 

(e) a nucleotide sequence which codes for an amino 
acid sequence with an equivalent recognition 
specificity for the peptide component of the 

T cell receptor ligand. 

12. Nucleic acid as claimed in claim 11, 
wherein 

the amino acid sequence X* ± ... X' n is selected 
from the group comprising SSETNS, SSETSS, TSGTAS, 
RSGTGS, SSGTDS, SSGTRS, SSGSDS , SSSTGS, SSSTVS, 
SSSTLS, SSSTLF, SSSTAS, SSHTDS, SSDTLS and SRWDSE. 

13. Nucleic acid as claimed in claim 12, 
wherein 

the amino acid sequence X 1 ^ ... X' ' n represents 
SSETNS, SSGTDS, TSGTAS or RSGTGS. 
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14. Nucleic acid as claimed in claim 11, 
wherein 

the amino acid sequence X'^ . .. X' ' n represents 
SSGTSSY or SSDQGM or the amino acid sequence 
X 111 ! X !ll n represents SADSFK. 

15. Vector, 
wherein 

it contains at least one copy of a nucleic acid as 
claimed in one of the claims 10 to 14. 

16. Cell, 
wherein 

it expresses a nucleic acid as claimed in one of 
the claims 10 to 14, 

17. Cell, 
wherein 

it is transformed with a nucleic acid as claimed in 
one of the claims 10 to 14 or with a vector as 
claimed in claim 15. 

18. Polypeptide, 
wherein 

it codes for a nucleic acid as claimed in one of 
the claims 10 to 14. 

19. Polypeptide as claimed in claim 18, 
wherein 

it comprises the variable domain of the p chain of 
a human T cell receptor. 
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20. Polypeptide, 
wherein 

it has T cell receptor properties and is composed 
of a polypeptide as claimed in claim 8 or 9 as well 
as a polypeptide as claimed in claim 18 or 19 as 
subunits. 

21. Polypeptide as claimed in one of the claims 8, 9, 
18, 19 or 20, 

wherein 

it is coupled to a labelling group or a toxin. 

22. Polypeptide as claimed in one of the claims 8, 9, 
18, 19, 20 or 21, 

wherein 

it is present in an oligomer ized form. 

23. Antibody against a polypeptide as claimed in one of 
the claims 8, 9, 18, 19, 20, 21 or 22 which is 
directed against a region which is responsible for 
recognizing the peptide ligand. 

24. Antibody as claimed in claim 23, 
wherein 

it is directed towards a CDR3 region. 

25. T cell, 
wherein 

it contains a T cell receptor as claimed in claim 
20. 
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26. Pharmaceutical composition which contains as active 
component a nucleic acid as claimed in one of the 
claims 1 to 4 or 10 to 14, a polypeptide as claimed 
in one of the claims 8, 9 or 18 to 23, a peptide 
ligand against the polypeptide, an antibody as 
claimed in claim 23 or 24 or a cell as claimed in 
claim 6, 7, 16, 17 or 2 5 optionally together with 
other active components as well as common 
pharmaceutical auxiliary agents, additives or 
carrier substances . 

27. Use of a pharmaceutical composition as claimed in 
claim 2 6 for the production of an agent for the 
diagnosis of tumour diseases or a predisposition 
for a tumour disease. 

28. Use of a pharmaceutical composition as claimed in 
claim 2 6 for the production of an agent for 
monitoring the course of the disease in a tumour 
disease. 

29. Use as claimed in claim 27 or 28, 
wherein 

the detection of T cells that express a polypeptide 
as claimed in claim 2 0 as the T cell receptor is 
carried out in a sample liquid by a nucleic acid 
hybridization assay, an immunoassay, a test for the 
binding of specific peptide ligands or a specific T 
cell activity test. 

30. Use of a pharmaceutical composition as claimed in 
claim 2 6 for the production of an agent for the 
prevention or therapy of a tumour disease. 
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31. Use as claimed in claim 30, 
wherein 

the agent is suitable for the stimulation of the 
growth of T cells that express a polypeptide as 
claimed in claim 2 0 as a T cell receptor, 

32. Use as claimed in claim 31, 
wherein 

the agent is suitable for growth stimulation of the 
T cells in vivo. 

33. Use as claimed in claim 31 or 32, 
wherein 

the agent for growth stimulation comprises the 
peptide ligand of the T cell receptor or/and the 
entire molecule from which the peptide ligand is 
derived or a fragment thereof. 

34. Use as claimed in claim 31 or 32, 
wherein 

the growth stimulation includes an antibody that 
specifically activates the T cell receptor. 

35. Use as claimed in claim 31, 
wherein 

the growth stimulation is carried out by isolating 
specific T cells, in vitro expansion and subsequent 
administration of expanded T cells. 

36. Use as claimed in one of the claims 27 to 35, 
wherein 

the tumour disease is a kidney cell carcinoma. 
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37. Process for the isolation of T cells that express a 
polypeptide as claimed in claim 20 as a T cell 
receptor, 

wherein 

a sample containing T cells is contacted with an 
agent that binds specifically to the CDR3 region of 
the T cell receptor, T cells that react with the 
agent are identified and optionally separated from 
other T cells. 

38. Process as claimed in claim 37, 
wherein 

the agent is selected from the peptide ligahd of T 
cells, a MHC peptide complex containing the peptide 
ligand or /and an anti-TCR antibody. 

39. Process as claimed in claim 37 or 38 additionally 
comprising an in vitro expansion of T cells. 

40. Process for the isolation of T cells which express 
a polypeptide as claimed in claim 2 0 as the T cell 
receptor , 

wherein 

nucleic acid sequences that code for the T cell 
receptor are introduced into a T cell line and are 
made to express therein. 

41. Process for the isolation of T cells that express a 
polypeptide as claimed in claim 20 as the T cell 
receptor , 

wherein 

nucleic acid sequences which code for the T cell 
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receptor are introduced into the germ line of an 
animal and the T cells are isolated from the 
resulting transgenic animal or descendants thereof. 

42. Transgenic animal, 
wherein 

it expresses a polypeptide as claimed in claim 2 0 
as the T cell receptor. 

43. Method for the identification of peptide ligands of 
a T cell receptor as claimed in claim 2 0 comprising 
the steps : 

(a) isolating RNA from tumour tissue, 

(b) converting the RNA into double-stranded cDNA 
molecules, 

(c) introducing the cDNA molecules into host cells 
to obtain a cDNA bank, 

(d) transfecting eukaryotic recipient cells with 
aliguots of the cDNA bank wherein (i) 
cotransf ection with HLA-A*0201 DNA is carried 
out or (ii) HLA-A*0201 DNA positive recipient 
cells are used, 

(e) testing the transfected recipient cells for 
their ability to secrete cytokines, 

(f) identifying a cDNA sequence which codes for 
the antigen which contains the peptide ligand 
and 
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(g) identifying the sequence of the peptide 
ligand. 

44, Method as claimed in claim 43, 
wherein 

step (e) comprises testing for the ability to lyse 
TNF-sensitive cells . 
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Abstract 



The present invention concerns new nucleic acid and 
amino acid sequences of the human T cell receptor and 
their use for the diagnosis and therapy of carcinomas in 
particular of kidney cell carcinomas. 



Fig. 1/1 



Results 



#2 6 tumour i . s , 



CDR3a-Region 



Fraament 



TCFAV20S1 



N-Region 



TCRAJ 



Clones {14/54} 



C L V G 
TGCCTCGTGGG TG 

Va.20 or Ja22 coded 



GSARQLTF 
GTTC T G CAA G G CAA CT GA C C TT 
TCRAJ22 



Clones (i/54) 



c . L v g 
TGCCTCGTGGG 



GSARQLTF 
GGTTCTGCAAGGCAACTGACCTT 
TCRAJ22 



Clones (H/54) 



c l v 

TGCCTCGT 



L 

CCT 



SGSARQLTF 
TTCTGGTTCTGCAAGGCAACTGACCTT 

TCRAJ22 



Clones: (5/54) 



C L V 

TGCCTCGTG 



L 

CT 



SGSARQLTF 
TTCTGGTTCTGCAAGGCAACTGACCTT 

TCPAJ22 



Clones (2/54) 



C L 
TGCCTCG 



A 

CTA 



TGSARQL TF 
CTGGTTCTTGCAAGGCAACTGACCTT 

TC?A J2 2 



Clones (1/54) 



C L 

TGCCTCG 



V 
T 



SGSARQ LTF 
TTCT GGTTCTGCAAGGCAACTGACCTT 

TCR^J22 



Clones (1/54) 



C L V 
TGCCTCGTGG 



V S 
TCTCC 



GSARQLTF 
GGTTCTGCAAGGCAACTGACCTT 
TCPAJ22 



Clones! d/54) 



C X 
TGCCTCG 



D S 
ACTCC 



GSARQLTF 
GGTTCTGCAAGGCAACTGACCTT 
TCRAJ22 



Clones (1/54) 



C L 
TGCCTCG 



D 

AC 



SGSARQLTF 
TCTGGTTCTGCAAGGCAACTGACCTT 

TCRAJ22 



Clones (2/54) 



c L 

TGCCTCG 



ALA 
CCCTGGGG 



GSARQLTF 
GGTTCTGCAAGGCAACTGACCTT 
7CRAJ22 
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Clones ' 



Clones (1/54) 



Clones (2/54) 



C -L 
TGCCTCG 



ALA 
CCCTGGCG 



C L 
TGCCTCG 



A P 
CGCCC 



GSARQLTF 
GGTTCTGCAAGGCAACTGACC7T 
TCRAJ22 



sgsarq'ltf 

TCTGGTTCTGCAAGGCAACTGACCTT 

TCRAJ22 



C L 



- rr „ ^SGSARQLTF 
TGCCT TC CTTCTGGTTCTGCAAGGCAACTGACCTT 



TCRAJ22 
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Results 



#2 6 tumour i.s, 



CDR3/5-Region 



:a ament 



TCR3V22S1 



N-TCRBD-N 



TCRBJ 



Clones (8/62) 



CASS 
TGTGCCAGCAG 



E T K 
CGAAACTAA 
TCRBD2 



s y E Q y r 
CTCCTACGAGCAGTACTT 
TCR3J2S7 



Clones (2/62) 



Clone n 



(1/62) 



CASS 
TGTGCCAGCAGT 



CASS 
TGTGCCAGCAGT 



E T N 

GA AACTAA T 
TCRBD2 



E T S 
GAAACTT CT 
TCRBD2 



s y e q y r 

TCCTACGAGCAGTACTT 
TCRBJ2S7 



s y E q y F 
TCCTACGAGCAGTACTT 
TCRBJ2S7 



Clone d/62) 



CASS 
TGTGCCAGCAGT 



E T S S Y E Q y r 

GAAACAAG CTCCTACGAGCAGTACTT 
TCRBD1 TCR3J2S7 



Clones (2/62) 



c A 
TGTGCCA 



T S G T A 
CCTCC GGGACAG CT 
TCRBD1 



s y E q y r 

TCCTACGAGCAGTACTT 
TCR5J2S7 



Clones (2/62) c A R S G T G 

TGTGCCAG ATCCGGGACAGG CTCCTACGAGCAGTACTT 

TCR3D1 TCRBJ257 



s y e q 



Clones (2/62) cass G t d 

TGTGCCAGCAGT GGGAC GGA CTCCTACGAGCAGTACTT 

• -. TCR3D1/2 TCR5J2S7 



s y e q y 



Clone a/62) cass gtd s y e o y . 

TGTGCCAGCAGT GGCACAGA CTCCTACGAGCAG^ctt 

TCR3D1 TCRBJ2S7 



Clone (1/62) 



CASS 
TGTGCCAGCAG 



GTD 

CGGGACAGAT 
TCR3D1 



s y e q y f 

TCCTACGAGCAGTACTT 
TCRBJ2S7 
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Clone (1/62) 



CASS 
TGTGCCAGCAGT 



G T R 

GGGACT CGT 
TCRBD2 



. S Y E Q Y f 
TCCTACGAGCAGTACTT 
TCRBJ2S7 



Clone (1/62) 



CASS 
TGTGCCAGCAGT 



G T R 
GGGACACGT 
TCRBD1 



S Y E Q y r 

TCCTACGAGCAGTACTT 
TCRBJ2S7 



Clones (2/62) 



CASS 
TGTGCCAGCAGT 



G T S S 
GGAACTAGCTCTT 
TCR3D2 



Y N E Q F F 
ACAATGAGCAGTTCTT 
TCRBJ2S1 



(TCRBJ2S7 sehr ahnlich) 



Clone d/62) 



Clones (5/62) 



CASS 
TGTGCCAGCAGT 



CASS 
TGTGCCAGCAGT 



GSD SYEQYF 
GGGTCCGA CTCCTACGAGCAGTACTT 
TCR3D1/2 TCRBJ2S7 



S T G 

TCGACAGGG 
TCR3D1 



SYEQYF 
TCCTACGAGCAGTACTT 
TCR5J2S7 



Clones (2/62) 



CASS 
TGTGCCAGCAG 



S T V 

CTCGACGGT 
TCRBD1/2 



SYEQYF 
CTCCTACGAGCAGTACTT 
TCR3J2S7 



Clone a/62) 



CASS 
TGTGCCAGCAGT 



STL 
TCAACATTA 
TCR3D2 



SYEQYF 
TCCTACGAGCAGTACTT 
TCR3J2S7 



Clone d/62) 



CASS 
TGTGCCAGCAGT 



S T L F 

TCAACATTATT 
TCRBD2 



Y E Q Y F 
CTACGAGCAGTACTT 
TCR3J2S7 



Clone (1/62) 



CASS 
TGTGCCAGCAGT 



S T A 

TC GACAG C. 
TCR5D1 



SYEQYF 
CTCCTACGAGCAGTACTT 
TCR3J2S7 



Clones (2/62) 



CASS 
TGTGCCAGCAG 



H T D 
CCACACCGA 
TCR3D1 



SYEQYF 
CTCCTACGAGCAGTACTT 
TCRBJ2S7 



Clones . (2/62) 



CASS 
TGTGCCAGCAGT 



D T L 
GACACCCT 
TCR5D1 



SYEQYF 
CTCCTACGAGCAGTACTT 
TCR5J2S7 
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Results #22 tumour i. s . CDR3a-Region 

Clones (13/34) y c x, v g 

Val6 or JaC coded tcraj22 



TACTGCCTCGTGGG TG ' ' otctg^GC^CTGaJctJ 



Clones (2/34) 



V C L V G 
TACTGCCTCGTGGG 



GSARQLTF 
GGTTCTGCAAGGCAACTGACCTT 
TCRAJ22 



Clone '{1734 ) 



Y C L V G 
TACTGCCTCGTGGG 



TC 



RSARQLTF 
GTTCTGCAAGGCAACTGACCTT 
TCRAJ22 



Clones (4/34) 



V C L V 
TACTGCCTCGT 



L 

CCT 



SGS ARQLTF 
TTCTGGTTCTGCAAGGCAACTGACCTT 

TCRAJ22 



ClortPQ (2/34) Y C L 

TACTGCCTCG 



A 

CTA 



TGSARQLTF 
CTGGTTCTGCAAGGCAACTGACC7T 

TCR^J22 



Results , #22 tumour i.s. 



CDR3/3-Region 



Fig. 4 



Clones (10/28) 



CAS 
TGTGCCAG 



A D S F K 
TG CCGATT CTTTTAA 
TCRBD2 



D T Q y F 
AGATACGCAGTATTT 
TCRBJ2S3 



Clones (4/28) 



CASS 
TGTGCCAGCAG 



E T N 

CGAAACTAA 
TCRBD2 



S Y E Q Y f 
CTCCTACGAGCAGTACTT 
Jb2. 7 



Clones (1/28) 



CASS 
TGTGCCAGCAGT 



D Q G M 
GATCAGGGGATG 
TCRSD2 



N E Q F F 

AATGAGCAGTTC7T 
TCRBJ2S1 



Clones {1/28} 



C A S R 

TGTGCCAGCAG 



W D S E 

GTGGGACTCCGAGG 
TCRBD2 



D E Q y f 
ACGAGCAGTACTT 
TCRBJ2S7 



SEQUENCE PROTOCOL 
(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Boehringer Mannheim GmbH 

(B) ROAD: Sandhofer Str. 112-132 

(C) CITY: Mannheim 

(E) COUNTRY: Germany 

(F) POSTAL CODE: 683 05 

(ii) TITLE OF INVENTION: T cells specific for kidney 

carcinoma 

(iii) NUMBER OF SEQUENCES : 22 

(iv) COMPUTER-READABLE FORM: 

(A) DATA CARRIER: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #i.o, version 

# 1.30 (EPA) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 41 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..801 

(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) POSITION: 1..54 

(ix) FEATURE: 

(A) NAME/ KEY: mat peptide 

(B) POSITION: 55.7801 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



Me? Ara Gin vlf ?f 2 2*? i? C GTG TTC CTG ACC CTG AGT ACT *TG 

Met Arg Gin Val Ala Arg Val He Val Phe Leu Thr Leu Ser Thr Leu 

ia ~ 15 -10 - 5 

AGC CTT GCT AAG ACC ACC CAG CCC ATC TCC ATG GAC TCA TAT GAA GGA 
Ser Leu Ala Lys Thr Thr Gin Pro He Ser Met Asp Ser Tyr Glu lly 
1 5 10 

GUi S?? AA ° A T A ACC TGT AGC CAC AAC ^ ATT GCT ACA AAT GAT 
Gin Glu Val Asn He Thr Cys Ser His Asn Asn He Ala Thr Asn Asp 

15 20 25 3 £ 

Itl tTf £ CG l GG l AC CAA CAG TTT CCC AGC . CAA GGA CCA CGA TTT ATT 
Tyr He Thr Trp Tyr Gin Gin Phe Pro Ser Gin Gly Pro Arg Phe lie 

35 40 45 



48 



96 



144 



192 



336 



384 



ATT CAA GGA TAC AAG ACA AAA GTT ACA AAC GAA GTG GCC TCC CTG TTT ? 4 n 
- He Gin Gly Tyr Lys Thr Lys Val Thr Asn Glu Val Ala Ser Leu Phe 
50 55 60 

ATC CCT GCC GAC AGA AAG TCC AGC ACT CTG AGC CTG CCC CGG GTT TCC 28R 
He Pro Ala Asp Arg Lys Ser Ser Thr Leu Ser Leu Pro Arg Val Ser 
65 70 75 

CTG AGC GAC ACT GCT GTG TAC TAC TGC CTC GTG GGT GGT TCT GCA AGG 
Leu Ser Asp Thr Ala Val Tyr Tyr Cys Leu Val Gly Gly Ser Ala Arg 
80 85 90 

CAA CTG ACC TTT GGA TCT GGG ACA CAA TTG ACT GTT TTA CCT GAT ATC 
Gin Leu Thr Phe Gly Ser Gly Thr Gin Leu Thr Val Leu Pro Asp He 
95 100 105 HO 

CAG AAC CCT GAC CCT GCC GTG TAC CAG CTG AGA GAC TCT AAA TCC AGT 432 
Gin Asn Pro Asp Pro Ala Val Tyr Gin Leu Arg Asp Ser Lys Ser Ser 

115 120 125 

GAC AAG TCT GTC TGC CTA TTC ACC GAT TTT GAT TCT CAA ACA AAT GTG 48 0 
Asp Lys Ser Val Cys Leu Phe Thr Asp Phe Asp Ser Gin Thr Asn Val 
, 130 135 140 

TCA CAA AGT AAG GAT TCT GAT GTG TAT ATC ACA GAC AAA ACT GTG CTA 
Ser Gin Ser Lys Asp Ser Asp Val Tyr He Thr Asp Lys Thr Val Leu 
145 150 155 

GAC ATG AGG TCT ATG GAC TTC AAG AGC AAC AGT GCT GTG GCC TGG AGC 
Asp Met Arg Ser Met Asp Phe Lys Ser Asn Ser Ala Val Ala Trp Ser 
160 165 170 

AAC AAA TCT GAC TTT GCA TGT GCA AAC GCC TTC AAC AAC AGC ATT ATT 62 4 

Asn Lys Ser Asp Phe Ala Cys Ala Asn Ala Phe Asn Asn Ser He lie 
175 180 185 190 

CCA GAA GAC ACC TTC TTC CCC AGC CCA GAA AGT TCC TGT GAT GTC AAG 67? 
Pro Glu Asp Thr Phe Phe Pro Ser Pro Glu Ser Ser Cys Asp Val Ly*2 

I 95 200 205 

CTG GTC GAG AAA AGC TTT GAA ACA GAT ACG AAC CTA AAC TTT CAA AAC 
L«u Val Glu Lys Ser Phe Glu Thr Asp Thr Asn Leu Asn Phe Gin Hn 
210 215 220 

CTG TCA GTG ATT GGG TTC CGA ATC CTC CTC CTG AAA GTG GCC GGG TTT 768 
Leu Ser Val He Gly Phe Arg He Leu Leu Leu Lys Val Ala Gly Phe 
225 230 235 

tel £S LeS S£2 ?° G f TG TGG TCC AG ° ^AGATCTGC AAGATTGTAA 

Asn L,eu ^eu Met Thr Leu Arg Leu Trp Ser Ser 

240 245 



528 



576 



720 



821 



GACAGCCTGT 


GCTCCCTCGC 


TCCTTCCTCT 


GCATTGCCCC 


TCTTCTCCCT 


CTCCAAACAG 


881 


AGGGAACTCT 


CCTACCCCCA 


AGGAGGTGAA 


AGCTGCTACC 


ACCTCTGTGC 


CCCCCCGGCA 


941 


ATGCCACCAA 


CTGGATCCTA 


CCCGAATTTA 


TGATTAAGAT 


TG CTG AAG AG 


CTGCCAAACA 


1001 


CTGCTGCCAC 


CCCCTCTGTT 


CCCTTATTGC 


TGCTTGTCAC 


TGCCTGACAT 


TCACGGCAGA 


1061 


GGCAAGGCTG 


CTGCAGCCTC 


CCCTGGCTGT 


GCACATTCCC 


TCCTGCTCCC 


CAGAGACTGC 


1121 


CTCCGCCATC 


CCACAGATGA 


TGGATCTTCA 


GTGGGTTCTC 


TTGGGCTCTA 


GGTCCTGGAG 


1181 



■ AATGTTGTGA GGGGTTTATT TTTTTTTAAT AGTGTTCATA AAGAAATACA TAGTATTCTT 
CTTCTCAAGA CGTGGGGGGA AATTATCTCA TTATCGAGGC CCTGCTATGC TGTGTGTCTG 
GGCGTGTTGT ATGTCCTGCT GCCGATGCCT TCATTAAAAT 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Gin Val Ala Arg Val He Val Phe Leu Thr Leu Ser Thr Leu 
~ 18 ». -15 -10 _ 5 

Ser Leu Ala Lys Thr Thr Gin Pro He Ser Met Asp Ser Tyr Glu Gly 
1 5 10 

Gin Glu Val Asn He Thr Cys Ser His Asn Asn He Ala Thr Asn Asp 
15 20 25 30 

Tyr He Thr Trp Tyr Gin Gin Phe Pro Ser Gin Gly Pro Arg Phe He 

35 40 45 

He Gin Gly Tyr Lys Thr Lys Val Thr Asn Glu Val Ala Ser Leu Phe 
50 55 60 

He Pro Ala Asp Arg Lys Ser Ser Thr Leu Ser Leu Pro Arg Val Ser 
65 70 75 

Leu ser Asp Thr Ala Val Tyr Tyr Cys Leu Val Gly Gly Ser Ala Arq 
80 85 90 

Gin Leu Thr Phe Gly Ser Gly Thr Gin Leu Thr Val Leu Pro Asp He 
95 100 105 HO 

Gin Asn Pro Asp Pro Ala Val Tyr Gin Leu Arg Asp Ser Lys Ser Ser 

115 120 125 

Asp Lys Ser Val Cys Leu Phe Thr Asp Phe Asp Ser Gin Thr Asn Val 
130 135 140 

Ser Gin Ser Lys Asp Ser Asp Val Tyr He Thr Asp Lys Thr Val Leu 
145 150 155 

Asp Met Arg Ser Met Asp Phe Lys Ser Asn Ser Ala Val Ala Trp Ser 
160 165 170 

Asn Lys Ser Asp Phe Ala Cys Ala Asn Ala Phe Asn Asn Ser He He 

180 185 190 

Pro Glu Asp Thr Phe Phe Pro Ser Pro Glu Ser Ser Cys Asp Val Lys 

195 200 205 

Leu Val Glu Lys Ser Phe Glu Thr Asp Thr Asn Leu Asn Phe Gin Asn 
210 215 . 220 

Leu Ser Val He Gly Phe Arg He Leu Leu Leu Lys Val Ala Gly Phe 
225 230 235 



1241 
1301 
1341 



Asn Leu Leu Met Thr Leu Arg Leu Trp Ser Ser 
240 245 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) POSITION: 1..933 

* (ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) POSITION: 1..63 

(iX) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) POSITION: 64.. 933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GAT ACC TGG CTC GTA TGC TGG GCA ATT TTT AGT CTC TTG AAA GCA 
Met Asp Thr Trp Leu Val Cys Trp Ala He Phe Ser Leu Leu Lys A^a 
zx ~ zu -15 -10 

GGA CTC ACA GAA CCT GAA GTC ACC CAG ACT CCC AGC CAT CAG GTC ACA 
Gly Leu Thr Glu Pro Glu Val Thr Gin Thr Pro Ser His Gin Val Thr 
5 1 5 10 

CAG ATG GGA CAG GAA GTG ATC TTG CGC TGT GTC CCC ATC TCT AAT CAC 
Gin Met Gly Gin Glu Val He Leu Arg Cys Val Pro He ITr £s*n Hil 
15 20 25 

7Z» l AC 11° m AT TGG TAC AGA CAA ATC TTG GGG CAG AAA GTC GAG TTT 
Leu Tyr Phe Tyr Trp Tyr Arg Gin He Leu Gly Gin Lys Val Glu Phe 

30 35 40 

CTG GTT TCC TTT TAT AAT AAT GAA ATC TCA GAG AAG TCT GAA ATA TTC 
Leu Val ser Phe Tyr Asn Asn Glu He Ser Glu Lys Ser Glu lie Phe 
45 50 55 

GAT GAT CAA TTC TCA GTT GAA AGG CCT GAT GGA TCA AAT TTC ACT CTG 
Asp Asp Gin Phe Ser Val Glu Arg Pro Asp Gly Ser ktn 111 Thl Hu 

65 70 75 

AAG ATC CGG TCC ACA AAG CTG GAG GAC TCA GCC ATG TAC TTC TGT GCC 
Lys lie Arg Ser Thr Lys Leu Glu Asp Ser Ala Met Tyr Phe Cys 111 

80 85 90 

AGC AGC GAA ACT AAC TCC TAC GAG CAG TAC TTC GGG CCG GGC ACC AGG 
Ser Ser Glu Thr Asn Ser Tyr Glu Gin Tyr Phe Gly Pro Gly Thr Arc 
95 100 105 

LeS Thr vll Thr C^n ? TG ?** AAC GTG TTC CCA CCC GAG G *C GCT 

Leu Thr yal Thr Glu Asp Leu Lys Asn Val Phe Pro Pro Glu Val Ala 

110 115 120 



48 



96 



144 



192 



240 



288 



336 



384 



432 



GTG TTT GAG CCA TCA GAA GCA GAG ATC TCC CAC ACC CAA AAG GCC ACA 
Val Phe Glu Pro Ser Glu Ala Glu He Ser HiS g^ ^2 A?a T^r 

130 135 

CTG GTG TGC CTG GCC ACA GGC TTC TAC CCC GAC CAC GTG GAG CTG AGC 
Leu Val Cys Leu Ala Thr Gly Phe Tyr Pro Asp His Val Glu Leu sir 

145 150 . . 155 

III III SI? J** 2?° ^ G G ? G GTG CAC AGT GGG GTC AGC A CA GAC CCG 
Trp Trp Val Asn Gly Lys Glu Val His Ser Gly Val Ser Thr Asp Pro 

160 165 17 q 

CAG CCC CTC AAG GAG CAG CCC GCC CTC AAT GAC TCC AGA TAC TGC CTG 
Gin Pro Leu Lys Glu Gin Pro Ala Leu Asn Asp Ser Arg ?y? cyt leu 
175 180 185 

q2J Hi ? GC ? TG AGG GTC TCG GCC ACC TTC TGG CAG AAC CCC CGC AAC 
Ser ser Arg Leu Arg Val Ser Ala Thr Phe Trp Gin Asn Pro Arg As^ 

190 195 200 

CAC TTC CGC TGT CAA GTC CAG TTC TAC GGG CTC TCG GAG AAT GAC GAG 
His Phe Arg Cys Gin Val Gin Phe Tyr Gly Leu Ser oil tel Hp ail 

2X5 

III Thl Gin AGG G f C ?** CCT GTC ACC CAG ATC GTC AGC GCC GAG 

Trp Thr Gin Asp Arg Ala Lys Pro Val Thr Gin He Val Ser Ala Glu 

225 230 235 

A?a Jro G^v Ala G f A GAC ^ T G ? C TTC ACC TCC GAG TCT T AC CAG CAA 
Ala Trp Gly Arg Ala Asp Cys Gly Phe Thr Ser Glu Ser Tyr Gin Gin 

240 245 250 

r?S S^f f TG e CT GCC ACC ATC CTC TAT GAG ATC TTG CTA GGG AAG GCC 
Gly Val Leu Ser Ala Thr He Leu Tyr Glu He Leu Leu Gly Lys Al 



260 265 



a 



624 



672 



720 



768 



816 



864 



912 



ACC TTG TAT GCC GTG CTG GTC AGT GCC CTC GTG CTG ATG GCC ATG GTC 
Thr Leu Tyr Ala Val Leu Val Ser Ala Leu Val Leu Se? A°a Met vll 

270 275 280 

AAG AGA AAG GAT TCC AGA GGC TAG 

Lys Arg Lys Asp Ser Arg Gly 936 
285 290 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asp Thr Trp Leu Val Cys Trp Ala He Phe Ser Leu Leu Lys Ala 

-15 -10 

Gly Leu Thr Glu Pro Glu Val Thr Gin Thr Pro Ser His Gin Val Thr 

1 .5 10 

Gin Met Gly Gin Glu Val He Leu Arg Cys Val Pro He Ser Asn His 



20 



25 



Leu Tyr Phe Tyr Trp Tyr Arg Gin He Leu Gly Gin Lys Val Glu Phe 
30 35 40 

Leu Val Ser Phe Tyr Asn Asn Glu He Ser Glu Lys Ser Glu He Phe 
45 50 55 

Asp Asp Gin Phe Ser Val Glu Arg Pro Asp Gly Ser. Asn Phe Thr Leu 
60 65 70 75 

Lys He Arg Ser Thr Lys Leu Glu Asp Ser Ala Met Tyr Phe Cys Ala 

80 85 90 

Ser Ser Glu. Thr Asn Ser Tyr Glu Gin Tyr Phe Gly Pro Gly Thr Ara 
95 100 105 

Leu Thr Val Thr Glu Asp Leu Lys Asn Val Phe Pro Pro Glu Val Ala 
110 115 120 

Val"phe Glu Pro Ser Glu Ala Glu He Ser His Thr Gin Lys Ala Thr 
125 . 130 135 

Leu Val Cys Leu Ala Thr Gly Phe Tyr Pro Asp His Val Glu Leu Ser 
140 145 150 155 

Trp Trp Val Asn Gly Lys Glu Val His Ser Gly Val Ser Thr Asp Pro 

160 165 170 

Gin Pro Leu Lys Glu Gin Pro Ala Leu Asn Asp Ser Arg Tyr Cys Leu 
175 180 185 

Ser Ser Arg Leu Arg Val Ser Ala Thr Phe Trp Gin Asn Pro Arg Asn 
190 195 200 

His Phe Arg Cys Gin Val Gin Phe Tyr Gly Leu Ser Glu Asn Asp Glu 
205 210 215 

Trp Thr Gin Asp Arg Ala Lys Pro Val Thr Gin He Val Ser Ala Glu 
220 225 230 235 

Ala Trp Gly Arg Ala Asp Cys Gly Phe Thr Ser Glu Ser Tyr Gin Gin 

240 245 250 

Gly Val Leu Ser Ala Thr He Leu Tyr Glu He Leu Leu Gly Lys Ala 
255 260 265 

Thr Leu Tyr Ala Val Leu Val Ser Ala Leu Val Leu Met Ala Met Val 
270 275 280 

Lys Arg Lys Asp Ser Arg Gly 
285 290 

(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 



(B) POSITION : 1..3 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGC CTC GTC CTT TCT GGT TCT GCA AGG CAA CTG ACC TTT 
Cys Leu Val Leu Ser Gly Ser Ala Arg Gin Leu Thr Phe 

295 300 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Cys Leu Val Leu Ser Gly Ser Ala Arg Gin Leu Thr Phe 
15 10 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) POSITION: 1..3 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



TGC CTC GCT ACT GGT TCT GCA AGG CAA CTG ACC TTT 
Cys Leu Ala Thr Gly Ser Ala Arg Gin Leu Thr Phe 
15 20 25 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Cys Leu Ala Thr Gly Ser Ala Arg Gin Leu Thr Phe 

5 10 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

, (A) LENGTH: 39 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 



(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..39 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TGT GCC AGC AGT GGA ACA GAT TCC TAC GAG CAG TAC TTC 
Cys Ala Ser Ser Gly Thr Asp Ser Tyr Glu Gin Tyr Phe 
15 20 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii). TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Ala Ser Ser Gly Thr Asp Ser Tyr Glu Gin Tyr Phe 
15 10 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



TGT GCC AGC AGT GAA ACA GAT TCC TAC GAG CAG TAC TTC 
Cys Ala Ser Ser Glu Thr Asp Ser Tyr Glu Gin Tyr Phe 
15 20 * 25 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Ala Ser Ser Glu Thr Asp Ser Tyr Glu Gin Tyr Phe 

5 10 



(2) INFORMATION FOR SEQ ID NO: 



13: 



(i) SEQUENCE CHARACTERISTICS • 

til ^ GTH: 39 base P^rs 

(B) type: nucleotide 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: I..39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 : 

rtl ?? C c GC o GT GGA ACA GCT TCC TAC GAG CAG TAC TTC 
cys A^a ser Ser Gly Thr Ala Ser Tyr Glu Gin Tyr Phe 
^ 20 * 25 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Ala Ser Ser Gly Thr Ala Ser Tyr Glu Gin Tyr Phe 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

f ff c GC e GT GGT ACA ^ TCC TAC GAG CAG TAC TTT 
cys Ala ser Ser Gly Thr Asn Ser Tyr Glu Gin Tyr Phe 

15 20 25 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Cys Ala Ser Ser Gly Thr Asn Ser Tyr Glu Gin Tyr Phe 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) POSITION: 1..39 

V- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



TGT GCC ACC TCC GGG ACA GCT TCC TAC GAG CAG TAC TTC 
Cys Ala Thr Ser Gly Thr Ala Ser Tyr Glu Gin Tyr Phe 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Cys Ala Thr Ser Gly Thr Ala Ser Tyr Glu Gin Tyr Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) POSITION: 1..39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TGT GCC AGA TCC GGG ACA GGC TCC TAC GAG CAG TAC TTC 
Cys Ala Arg Ser Gly Thr Gly Ser Tyr Glu Gin Tyr Phe 
15 20 25 



▼ — ^^^^^ 

x (2) INFORMATION FOR SEQ ID NO: 20: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20:. 

Cys Ala Arg Ser Gly Thr Gly Ser Tyr Glu Gin Tyr Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



CACTGAAGAT CCATCATCTG 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



TAGAGGATGG TGGCAGACAG 



